Hybridization Number on Three Trees
نویسندگان
چکیده
Phylogenetic networks are leaf-labelled directed acyclic graphs that are used to describe nontreelike evolutionary histories and are thus a generalization of phylogenetic trees. The hybridization number of a phylogenetic network is the sum of all indegrees minus the number of nodes plus one. The Hybridization Number problem takes as input a collection of phylogenetic trees and asks to construct a phylogenetic network that contains an embedding of each of the input trees and has a smallest possible hybridization number. We present an algorithm for the Hybridization Number problem on three binary phylogenetic trees on n leaves, which runs in time O(ckpoly(n)), with k the hybridization number of an optimal network and c some positive constant. For the case of two trees, an algorithm with running time O(3.18kn) was proposed before whereas an algorithm with running time O(ckpoly(n)) for more than two trees had prior to this article remained elusive. The algorithm for two trees uses the close connection to acyclic agreement forests to achieve a linear exponent in the running time, while previous algorithms for more than two trees (explicitly or implicitly) relied on a brute force search through all possible underlying network topologies, leading to running times that are not O(ckpoly(n)), for any c. The connection to acyclic agreement forests is much weaker for more than two trees, so even given the right agreement forest, the reconstruction of the network poses major challenges. We prove novel structural results that allow us to reconstruct a network without having to guess the underlying topology. Our techniques generalize to more than three input trees with the exception of one key lemma that maps nodes in the network to tree nodes and, thus, minimizes the amount of guessing involved in constructing the network. The main open problem therefore is to prove results that establish such a mapping for more than three trees. ∗Centrum Wiskunde & Informatica (CWI), Science Park 123, 1098 XG Amsterdam, The Netherlands, [email protected] †Leo van Iersel and Nela Lekić were respectively funded by Veni and Vrije Competitie grants from The Netherlands Organisation for Scientific Research (NWO). ‡Department of Knowledge Engineering (DKE), Maastricht University, P.O. Box 616, 6200 MD Maastricht, The Netherlands, [email protected], [email protected] §Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N., PO Box 19024, Seattle, WA, USA 98109, [email protected] ¶Faculty of Computer Science, Dalhousie University, 6050 University Ave, Halifax, NS B3H 1W5, Canada, [email protected] 1 ar X iv :1 40 2. 21 36 v1 [ cs .D S] 1 0 Fe b 20 14
منابع مشابه
Hybridization Number on Three Rooted Binary Trees is EPT
Phylogenetic networks are leaf-labeled directed acyclic graphs that are used to describe nontreelike evolutionary histories and are thus a generalization of phylogenetic trees. The hybridization number of a phylogenetic network is the sum of all in-degrees minus the number of nodes plus one. The hybridization number problem takes as input a collection of rooted binary phylogenetic trees and ask...
متن کاملFast computation of minimum hybridization networks
MOTIVATION Hybridization events in evolution may lead to incongruent gene trees. One approach to determining possible interspecific hybridization events is to compute a hybridization network that attempts to reconcile incongruent gene trees using a minimum number of hybridization events. RESULTS We describe how to compute a representative set of minimum hybridization networks for two given bi...
متن کاملKernelizations for the Hybridization Number Problem on Multiple Nonbinary Trees
Given a finite setX, a collection T of rooted phylogenetic trees onX and an integer k, the Hybridization Number problem asks if there exists a phylogenetic network on X that displays all trees from T and has reticulation number at most k. We show two kernelization algorithms for Hybridization Number, with kernel sizes 4k(5k) and 20k(∆ − 1) respectively, with t the number of input trees and ∆ th...
متن کاملFast Computation of the Exact Hybridization Number of Two Phylogenetic Trees
Hybridization is a reticulate evolutionary process. An established problem on hybridization is computing the minimum number of hybridization events, called the hybridization number, needed in the evolutionary history of two phylogenetic trees. This problem is known to be NP-hard. In this paper, we present a new practical method to compute the exact hybridization number. Our approach is based on...
متن کاملCherry picking: a characterization of the temporal hybridization number for a set of phylogenies.
Recently, we have shown that calculating the minimum-temporal-hybridization number for a set [Formula: see text] of rooted binary phylogenetic trees is NP-hard and have characterized this minimum number when [Formula: see text] consists of exactly two trees. In this paper, we give the first characterization of the problem for [Formula: see text] being arbitrarily large. The characterization is ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1402.2136 شماره
صفحات -
تاریخ انتشار 2014